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ABSTRACT 

The monograph defines and discusses single subject 
research designs in special education* Advantages of this approach 
with measuring change in handicapped children are noted. 
Observational techniques (event recording, duration recording, 
interval recording, and time sampling) are described, along with 
three types of single subject evaluation procedures (A-B, reversal, 
and multiple baseline) • A final section considers the presentation 
and interpretation of results. The authors conclude that the 
methodology of the single subject approach is simple and wall salted 
to the individualized education program requirement of P.L.> 94-142, 
the Education for All Handicapped Children Act. (CL) 
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In view of demands for greater teacher accountability in the education of 
handicapped children, the use of empirically-based methods for documenting child 
chai^ge represents a valuable addition to educational programs. Since the relative 
impact of various instructional methods may not always be readily apparent, 
techniques for evaluating the impact of instruction on tlie individual child are 
necessary. Single subject evaluation procedures, the most appropriate technique to 
meet these needs, are particularly applicable to special education classrooms 
including handicapped preschool programs, resource rooms, and TMR and EMR 
programs. 

Baer, Wolf, and Risley (1968) discuss single subject procedures in the context 
of "applied behavior analysis." This refers to the systematic study of the 
relationship between specific behaviors of the individual child and t!ie specific tech- 
niques which are employed to change the child's behaviors, Lovitt (1975) describes 
five attributes of single subject evaluation procedures: (1) the direct measurement 
of the' child's behavior, (2) the continuous measurement of the child's behavior on 
a daily or near daily basis, (3) an understanding of the idiosyncrasies of the 
child's behavior, (4) the demonstration of a functional relationsliip between the 
interventic-T: and the child's behavior, and (5) interventions that are adequately 
described and therefore replicable. As such, single subject evaluation procedures 
are highly compatible with special education activities due to the emphasis placed 
upon the individual child. 

Although group evaluation procedures have traditionally been used in the 
evaluation of treatment effectiveness, single subject evaluation procedures may be 
more appropriate when studying handicapped children for several reasons. 

1. The variability with which the behavior of the individual child occurs may 
be documented. As such, fluctuations in the occurrence of the child's 
behavior may be accounted for on an ongoing basis. 

2. The individual child serves as his own control. Therefore, any changes 
in the child's behavior may be evaluated within the context of the child's 
unique characteristics and behavioral repertoire. Single subject pro- 
cedures show when the child's behavior has changed and also how it has 
changed. Further, single subject procedures document the relative speed 
with which the child acquires new skills and also the manner in which 
these skills were acquired. 

3. The procedures grew out of problems frequently addressed by teachers. 
As such, the impact of instruction upon the individual child may be 
evaluated without disrupting ongoing educational activities. Single 
subject procedures are appropriately used in settings where the primary 
goal is to remediate the child's skill deficits or improve the child's social 
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functioning. Single subject procedures do not necessitate that changes 
occur in the intervention strategy for the child. Rather, they are meant 
to follow tlie course of treatment nnd instruction which may best serve 
the needs of the child. 
^, In single subject procedures the relative Impact of instruction is evaluated 
by visually analyzing the graphically presented data. Therefore, the use 
of statistical procedures are net necessary because t >• significance of the 
intervention is visually apparen*^ 

While single subject evaluation procedures are useful in c :umenting individual 
sources of variance and the relative impact of intervention, there are certain 
limitations which must be considered. The degree to which one may generalize frjm 
the individual child to other children may be limited. As such, one cannot inrer 
that an intervention which is effective with one child will be equally effective with 
another child . Therefore, Leittenberg (1 973) advocates the use of single subject 
evaluation procedures for comparing the different interventions once their 
effectiveness has been established . Thus, single subject designs are useful 
techniques for the evaluation of specific instructional methods, but may not be 
sufficient for the comparison of different instructional methods wliich are already 
effective. 

Group designs involve the comparison of two or more groups of children who 
have received different instructional methods. For example, an educator may be 
interested in determining the impact of one instructional metiiod as compared with 
another. He may choose to divide a set of students into two groups, giving one 
group a particular instructional method while the other group receives another 
instructional method. Based upon some measure (for example, scores on a 
standardized test), the educator may compare the two groups to determine the 
relative effectiveness of the two instructional methods. Through the application of 
statistical procedures, the relative effectiveness of the methods can be identified. 

However, objections to the use of group designs have been based upon a 
variety of criticisms (Bergin S Strupp. 1976; Hersen & Barlow, 1976). They in- 
clude: 

1. Group designs may require that a substantial number of children not 
participate in a given treatment condition. As such, the withholding of 
treatment from the children In the control group raises ethical questions. 

2. Group designs require that a sufficiently large sample of handicapped 
children be included in each group. Due to the extreme variability of the 
skill levels and behaviors found among handicapped children and due to 
the differing needs of each child, it becomes difficult to Identify 
homogeneous groups of children for studies. 

3. Group designs may demonstrate statistically significant differences 
between groups, but the educational significance of the intervention for a 
particular child may be minimal (Bergin, 1966; Chassan, 1967; Sidman, 
1960). The characteristics of the individual child are lost in group 
designs. As such, it becomes difficult for the practitioner to determine 
the extent to which the child in the classroom is similar to those children 
who exhibited change during the group evaluation procedure. 

In special education settings, instruction may be different for each child 
depending upon his unique needs and attributes. Single subject evaluation 
procedures emphasize the child^s uniqueness and focus on differences found in one 
child over time rather than differences between groups of children. Thus, single 
subject evaluation procedures may be more applicable to the study of handicapped 
children than are group procedures. 
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.far this child , 3elf-abusive behavior mr/ be defined as 
-child: IS observed hitting himself on his head with his hand 
" al . Tne precision and accuracy of the definition can then 
twc independent observers simultaneously record the 
self-ibuslve behavior. It Is important to stress that the 
:ne afinition makes it highly unlikely that the same 
definition would be tiriJy appr opriate for different children manifesting self-abusive 
behaviors. 



MEASUREMENT OF THE i :GET BEHAVIOR 



After selecting and r:efining the target behavior, the level at which the 
behavior naturally occurs prior to intervention must be measured. This measure 
wil I serve as a stancard r "baseline" against which 'he effectiveness of the 
intervention procedure - in b- evaluated. 
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JRATION RECORD 



Frequently the occ ^ence or non-occurrenc* of the target behavior is 
secondary to how long lasts. In these situations, duration recording is an 
appropriate recording ter^- ique to use. A stop wa or other timing device may 
be used to measure the : _~ation of the target behav ur. For example, a teacher or 
observer may record thi duration of a temper tantrum each time the behavior 
occurs. Total duration average duration, and tie percent of the observation 
sessirn during which the behavior occurred can be determined using this method. 
Like 3vent recording, this procedure requires that tne target behavior have a fixed 
begirning and end. If an educator is interested in reporting only the total 
duration of the target behavior per sessiqn, then the length of each observation 
session must be identical. However, if an evaluator is interested in reporting the 
average duration or the percent of the session occupied by the target behavior, the 
length of each observation session can vary. 
«» 

INTERVAL RECORDING 

Interval recording is obtained by dividing the observation session into several 
separate but equal time periods. For example., a 5-minute observation session may 
be divided into twenty 15-second intervals. If the target behavior occurs at any 
time during an interval, it is noted as having occurred during that interval. For 
example, if the child is out of his seat at any time during a 15-second interval^ a 
check mark is made in the appropriate interval on the observer's recording sheet. 
However, if a child is out of his seat more than once during the 15-seccnd interval, 
only one check mark is made. The number and percent of intervals during which 
the child engages in the target behavior are reported. Since the procedure 
requires an observer's undivided attention, it may necessitate that someone in 



EKLC 



5 



addition to the teacher is available to do the recording. Typically, the length f 
observation sessions is determined by noting the length of the behavior and settl" g 
the time period so that is is highly unlikely that more than one occurrence of t-e 
target behavior can occu^ within a given time interval. 

TIME SAMPLING 

Time sampling is one of the easiest observational techniques to use in ^~^ 
educational setting because it does not require an observer's undivided attentic-. 
Thus, a teacher can easily use time sampling to measure the target behavior wh: i 
at the same time teaching a class. As with interval recording, the observati^r 
sessions are divided into a number of equal intervals. However, unlike inter, 
recording, the child is observed only at the end of the interval. Therefore, t 
observer records whether or not the behavior is occurring at a specific insta:^:. 
For example, a 3-hour session may be divided into twelve IS-minute intervals. - . 
the end of each 15-minute interval, the observer looks at the child and records 
whether or not the behavior is occurring. For example, a teacher might look a: i 
child every 15 minutes for three hours and record whether or not stie is crying. 
Time sampling provides the teacher with the number and percent of inter-.3is 
sampled during which a child engages in the target behavior. Like inte- va^ 
recording, time sampling can be used when the target behavior does not have a 
clear beginning and end. In addition, although the intervals within eacli se io- 
must be of equal length, the length of the sessions does not need to be equal .h^ 
percent of occurrence is reported. 

INTEROBSERVER AGREEMENT 

As previously noted, it is important that the target behavior is defined sc that 
two observers simultaneously agree on the occurrence of behavior when they 
independently observe the same child. Thus, it is important that the observe-s are 
properly trained in the nnethod of recording and in the definition of the ~arget 
behavior. Although observers may initially observe the behavior in the same 
manner at the beginning of the study, their definition may independently "drift" 
and they may define the behavior differently by the end of the study (Johnson 
Bolstad, 1973; O'Leary & Kent, 1973). As such, perceived changes ur improvement 
in the 'target behavior may in fact be due to a change in how the behavior is 
measured. Consequently, several authors recommend training a minimum of two 
people to" observe and record the behavior and periodically computing interobscrver 
agreement throughout the evaluation (Bijou, Peterson & Ault, 1968; Johnson & 
Bolstad, 1973; McNamara & MacDonough, 1972; O'Leary & Kent, 1973; Reid, 1970; 
Reid & Demaster, 1972; Romanczyk, Kent, Diament & O'Leary, 1 973). 

Interobserver agreement can be computed between pairs of observers at any 
time throughout the course of the study. The most common method of assessing 
interobserver agreement has been that of percent agreement (Johnson & Bolstad, 
1 973; Kelly, 1977; Yelton, Wildman & Erickson, 1977). Percent agrceme-; is 
calculated using the following formula: 



No. of Agreements x 100 = Percent Agreement 

No. of Agreements + No. of Disagreements 



Although percent agreement is an acceptable method for computing interobserver 
agreement, it does have several limitations (Johnson & Dolstad, 1973). Cohen's 
Kappa (19'60) avoids the problems of percent agreement and has therefore been 
recommended for analysis of interobserver agreement (Hartmann, 1977; Yelton, 
Wildman & Erickson, 1977). For most educational evaluators, however, percent 



agrewTient is usually adequate and -nr 
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(O'Leary & Becker, 1967; Hall, Lun 
1969). However, the level of agr&: 
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LENGTH OF THE BASELINE 



ipler to compute. Generally, acceptable 
-)ve 75 percent for classroom behaviors 
3CKSon, 1968; Barrish, Saunders & Wolf, 
::>etvveen observers may differ depending 
:: method of calculating reliability. 



The purpose of a baseline is tc orov:de a standard against which the effective- 
ness of an intervention can be ev-:ua-ed. Therefore, it is important not only to 
adequately define and obtain accurEiie Tneasures of the behavior, but also to ensure 
that baseline measures are taken ovt^r* a sufficient period of time. 

Barlow and Hersen (1973) re3:mmend that baseline data be gathered for a 
minimum of three sessions prior to c^ginning an intervention. At least three data 
points are necessary to reveal the presence of a pattern in the baseline data. For 
example, with a minimum of three prints, an evaluator can determine if there is an 
upward, downward or stable trend in the data. An upward trend in the data 
during the baseline condition gener-aliy means that the occurrenco^ of the target 
behavior is increasing over time (ascending baseline) and a downward trend in the 
data means that the occurrence cf the target behavior is decreasing (descending 
baseline) . In addi tion, a t least three data points are necessary to give an 
investigator an idea of the variability of the target behavior during the baseline 
condition. 

McNamara and McDonough (isn). Wolf and Risley (1971), and Baer, Wolf and 
Risley (1968) agree that baseline measurements continue until stability in the 
baseline data is attained. In other words, they contend that baseline data should 
be gatKiered until there is little change in the occurrence of the behavior between 
sessions. Therefore, if change in the behavior occurs after the Intervention, it 
can be more easily attributed to the intervention. Sidman (1960) recommended that 
in a laboratory, under controlled conditions, rates of behavior should be within a 5 
percent range of the mean. However, as Hersen and Barlow (1976) state, this Is 
often not practical when doing applied research. For example, an investigator may 
not have the time to continue gathering pretreatment data until stability is attained. 
It is also difficult to isolate and therefore eliminate all sources of variance in 
applied settings. 

Although conclusions regarding the effectiveness of an intervention must be 
made carefully, an instructional or treatment procedure is often implemented without 
a stable baseline. Several techniques for dealing with a variable baseline and the 
use of inferential statistics are available (see Hersen & Barlow, 1976). Instructional 
and treatment procedures are also sometimes started when a baseline is either 
ascending or descending. For example, if the intent of the teacher is to increase 
the occurrence of a child's behavior and the baseline is descending, the 
instructional procedure could be initiated. Since a change in the direction of the 
slope of the line is expected (i.e., descending to ascending), the effect of the 
procedure can be evaluated. However, if the intent of the teacher is to increase 
the occurrence of the specified behavior and the baseline is ascending, the 
intervention should not be initiated. The effectiveness of the instructional 
procedure would be difficult to assess as the slope of the line may not be expected 
to change. It would be difficult to attribute improvement to the procedure unless 
there was a marked change in the slope of the line. 



INTRODUCTION OF THE INTERVENTION 



After data have been gathered during the baseline condition, the intervention 
is in'iroduced. The intervention may be an instructional or treatment technique 
which the educator believes will modify the occurrence of the target behavior. Data 



collection continues when the intervention is initiated and throughout the remainder 
of the evaluation. It is important that the data collection procedures are identical 
during the baseline and interverUion conditions. Data must be gatliered at the same 
time of day (Russell & Bernal, 1977) or during the same setting or class throughout 
the evaluation. For example, if a child is observed at 10:00 a.m. during the 
baseline condition, the child must also be observed at 10:00 a.m. after the intro- 
duction of the intervention, unless additional data collected at other times indicate 
that time of day Is not a relevant factor. 

It is also important that factors other than those part of the instructional 
intervention remain the same throughout the evaluation procedure. For example^ it 
may not be wise to change a child's medication, classroom, or teacher during the 
evaluation unless these variables are part of the intervention process. If these 
extraneous factors are controlled and similar data collection procedures are followed 
throughout the evaluation, a change or improvement in the behavior following the 
intervention may be more easily attributed to the intervention. 

A frequent error of educators using single subject procedures is the 
introduction of more than one type of instruction at a time in an attempt to assess 
the interactive effect of the two instructional methods (Hersen & Barlow, 1976). If 
more than one type of instruction at a time is introduced, it is difficult to tell 
which method contributed to changos In the target behavior. Therefore, only one 
factor at a time should be changed throughout the entire evaluation and only after 
stability has been obtained during the previous treatment condition. 



SINGLE SUBJECT PROCEDURES 



There are essentially three types of single subject evaluation procedures — 
A-B, reversal, and multiple baseline — each with advantages and disadvantages. All 
share a common goal of demonstrating the relative impact of a given intervention 
strategy upon a target behavior. All use the subject as his own control and all are 
appropriate for use in clinical /ecucational settings. 



A-B DESIGN 



The simplest single subject procedure is the A-B design. With the A-B 
paradigm, the A represents a baseline condition and the B represents an 
intervention condition. The success of the A-B design is dependent upon obtaining 
an accurate baseline measure. Once the baseline measure is established, an 
intervention may be instituted. During the intervention, data are collected in the 
same manner as during the baseline condition. The impact of the intervention is 
determined by the extent to whicii the occurrence of the target behavior during 
intervention is different from the occurrence of the target behavior under baseline 
conditions. However, the A-B design does have certain limitations which must be 
considered (RIsley & Wolf, 1 972). 

The major assumption of the A-B design is that the baseline measure is 
accurate and predictive of future behavior. It has been argued, however, that 
changes In the target behavior during the intervention phase may be due to factors 
external to the intervention (Wolf & Risley , 1 971 ) . For example, a child may 
receive instruction at home without the teacher's awareness, at the same time that 
an intervention is started at school. The child's learning might change to a greater 
degree than would have occurred with the classroom intervention alone. Therefore, 
when using an A-B design, one cannot assume that a change in the target behavior 
is always attributed to the intervention. Other /actors (e.g., medication, illness) 
may be responsible for the observed change. 
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The extent to which external factors are controlled and precise baseline 
measures are established enhances the conclusions which may be drawn from the 
A-B procedure. The A-3 procedure may be useful when instructional or treatment 
limitations prevent the withdrawal or intermittent application of the intervention. 
For example, an A-B procedure might be useful with a child who exhibits a high 
rate of self-destruclive behavior. Due to the potential for physical harm to the 
child, it would not be appropriate to make unnecessary changes in the intervention 
once it had been initiated. 

Figure 1 is an example of an A-B design used with a handicapped preschooi 
child. As shown, the child's skill level remained stable during the baseline 
condition. Following the start of the instruction, changes occurred in the child's 
performance of the skill. A comparison of the child's performance during instruc- 
tion with the child's performance during baseline reveals that the child acquired 
additional skills during instruction. Whether this increase is solely attributable to 
the instruction is not clear. It is conceivable that the skill increase might have 
occurred without tl le instruction. However , the fact that the behavior was stable 
during baseline may strengthen the teacher's confidence that the increase was due 
to the instruction. 



FIGURE 1. Comparison of baseline and intervention conditions for a fine 
motor skill taught to a handicapped preschool child, utilizing the A-B 
design. 
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REVERSAL DESIGN 



A more favorable single subject procedure is the A-B-A, or "reversal," 
design. In the A-B-A design, a baseline measure is obtained as in the A-B 
design. Following the baseline period (A), an intervention (B) is applied. The B 
phase is followed by a return to baseline or withdrawal (A) of the intervention. If 
the intervention has an impact, the target behavior might be expected to return to 
baseline levels during the reversal phase since the intervention which prompted the 
change in behavior is no longer in effect. Therefore, during the reversal 
condition, the extent to which the behavior "returns" or approximates the level of 
behavior during the baseline condition increases the teacher's certainty that the 
Intervention was effective. If the behavior does not return to baseline levels 
during the reversal phase, several interpretations are possible: (1) the 
intervention was effective bu6 the newly acquired behavior was not 
reversible, (2) the intervention was not effective and the observed change is 
attributable to some outside, uncontrolled variabtois) , or (3) the intervention 
technique and other uncontrolled variables jointly contributed to the observed 
effect. 

When the occurrence of the behavior during the intervention condition differs 
from the behavior during baseline and reversal conditions, the teacher's conclusions 
regarding the efficacy of the intervention are strengthened. As such, the inclusion 
of a reversal phase eliminates many of the objections raised to the A-B design. An 
observed change in the target behavior following the initiation of an intervention 
can be attributed to the intervention rather than to extraneous factors if the 
behavior returns to base'ine levels during the reversal phase* However, the A-B-A 
procedure necessitates that the intervention be interrupted, an option which may 
not always be feasible or ethical. 

Although there is no formula for determining the length of a given intervention 
condition, certain factors must be considered: 

1. The intervention phase should continue until a degree of stabilit y is 
obtained in the occurrence of the target behavior • As such , the aata 
should show a steady improvement in the occurrence of the target 
behavior until the lack of a trend becomes apparent. 

2. The intervention phase should be sufficiently long to assure that 
observed changes in the target behavior are not due to the variability 
with which the behavior naturally occurs. When there is a broad range 
of variability in the occurrence of the behavior, more data points are 
required to demonstrate the Impact of the intervention. Therefore,, the 
intervention should continue until the range of variability during the 
intervention phase does not significantly overlap the range of variability 
during baseline. This is particularly important during the later sessions 
of the intervention condition when the treatment effect should be most 
apparent. For example, a child got out of her seat an average of 18 
times during a half-hour observation session. The range was from 14 to 
24 occurrences during any given session. Following the application of a 
time-out procedure, the average decreased to 13 occurrences with a range 
of variability from 10, to 18 occurrences during a half-hour observation. 
Although the overall occurrence of the out-of-seat behavior had 
decreased, the range of overlap was too large (from 14 to 18 occurrences) 
to conclude that the time-out procedure was effective. As such, the 
intervention procedure continued until the range of overlap decreased 
further. When the range of overlap between baseline and intervention 
decreased to only two occurrences, the intervention procedure was 
terminated. Conclusions regarding the efficacy of the intervention can be 
made at this point since the overlap is minimal. 
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3. An eg iial number of data points should be included during each phase if 
possible (Barlow & Hersen, 1 973). When baseline and intervention phases 
are of unequal lengths, observed changes may' be due to extraneous 
factors associated with the unequal length of phases. For example, the 
child might receive additional attention during the longer phases, or 
greater effects due to maturation may occur during the longer phases. 
Thus, the length of the intervention condition is dependent upon the 
level at which the target behavior occurred during baseline, the 
variability of the behavior during baseline and intervention conditions, 
and the length of the baseline condition. 

The A-B-A procedure may be an appropriate technique to use when 
documenting the impact of a behavior management intervention . However, the 
A-B-A procedure is not an appropriate technique for instructional interventions. 
Instructional interventions are directed toward increasing the child's ability to 
perform a given skill. As such, a return to baseline levels during a reversal 
condition may not be expected since increases in the child's ability to perform a 
skill may be relatively permanent. 

Figure 2 shows an example of an A-B-A procedure used to document the 
impact of an intervention on the out-of-seat behavior of a handicapped preschool 



FIGURE 2. An example of an A-E-A evaluation procedure used to 
document the impact of intervention on the out-of-seat behavior of a 
child. 
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child. As shown, there was a substantial decrease in the child's out-of-seat be- 
havior during the intervention phase relative to the baseline phase. During the 
reversal condition, the child's out-of-scat behavior increased to the baseline level. 
Since the out-of-seat behavior decreased during the intervention condition and did 
not do so during the baseline and reversal conditions, one may infer that the 
observed decrease during the intervention phase was due to the intervention . 

In most instances, an A-B-A-B procedure is more appropriate th^^n the A-B-A 
procedure {Miller, 1973). With an A-B-A-B design, the intervention or treatment is 
reapplied after the reversal condition. The degree to which the two intervention 
conditions show a similar Impact upon the target behavior, while differing from 
baseline and reversal conditions, increases one's confidence that the intervention 
was effective. V/ith repeated intervention and reversal conditions showing similar 
results, evidence regarding the efficacy of the intervention is accrued. Thus, the 
greater the number of reversals, the greater one's assurance that the intervention 
is effective when the behavior repeatedly returns to baseline levels during the 
reversals. 

The A-B-A-B procedure is regarded as more powerful than the A-B-A 
procedure, in that It allows for two {or more with repeated applications, 
A-B-A-B-A-B) opportunities to demonstrate the effectiveness of the intervention 
{Barlow & Hersen, 1973). An additional advantage of the A-B-A-B procedure is 
that the final condition is an intervention condition which may be continued 
following the evaluation. As such, the disadvantages associated with the withdrawal 
of the intervention are minimized. 

The A-B-A-B procedure may be used with instructional interventions while the 
A-B-A procedure may not. Since skills which have been learned would not be 
expected to return to baseline during a reversal condition, the A-B-A procedure 
would offer little information regarding the impact of Instruction. However, when 
the reversal condition is followed by a reapplication of the instructional intervention 
{as in the A-B-A-B design), comparison can be made between the child's skill gain 
during the baseline and reversal conditions with that during the intervention 
conditions. The degree to which the child demonstrates skill gains during 
Instruction while the baseline and reversal phases remain stable provides evidence 
that the instruction contributed to the change. 

As shown in Figure 3, the child's skill level increased during the instruction 
condition relative to the reversal condition. That the child's skill level increased 
only during the instruction condition suggests that the instruction was responsible 
for the observed change. The inclusion of the second Instruction condition elimi- 
nates the objections which are found with the A-B-A design, i.e., that the 
intervention and reversal condition did not impact upon the target behavior and 
that any observed changes would have occurred with or without the intervention 
and reversal conditions. The degree to which the target behavior changes in a 
similar manner during the intervention condition while remaining stable during the 
baseline and reversal conditions increases one's certainty that the intervention had 
an impact. 

Figure 4 shows an example of an A-B-A-B procedure used to decrease the 
occurrence of an inappropriate behavior. The degree to which the target behavior 
during the second intervention condition approximates the first intervention 
condition increases one's confidence that the intervention was effective. Thus, 
there are essentially two advantages to the A-B-A-B procedure over the A-B-A 
procedure for behavior management interventions: {1 ) the final condition is an 
intervention condition and (2) the second intervention serves as a replication of the 
first intervention which may provide additional information regarding its effective- 
ness. 

An alternative to the A-B-A and A-B-A-B procedures is the B-A-B procedure. 
In the B-A-B procedure, the first condition is an intervention condition {B), 
followed by a non-intervention condition {A). The intervention condition {B) is then 
reapplied. The B-A-B procedure may be desirable when it is not possible to obtain 
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FIGURES, An A-B-A-B design used to document the impact of 
Instruction upon the performance of a child's skill. 
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a baseline measure prior to the application of an intervention. For example, a child 
may exhibit n self-destructive behavior which is very damaging to his vision, in 
such an instance, it would not be appropriite to delay intervention for the 
purposes of collecting baseline data. The use of a B~A-B procedure would be 
appropriate. 

In the absence of a baseline condition, however, i t may not be possible to 
accurately determine the degree to which the intervention changed the occurrence 
of the target behavior from the level at which it would normally occur. An A-B-A~B 
procedure is more desirable when baseline measures may be obtained (Barlow & 
Hersen, 1973). 

Figure 5 is an example of the B~A-B procedure used with a child exhibiting 
self-destructive behavior. As shown, the occurrence of the self-destructive behav- 
ior showed an overall decrease during the first intervention condition. This trend 
reversed during the non-intervention condition, returning to the level at which the 
target behavior occurred when the intervention was first initiated. During the 
second intervention condition, the occurrence of the target behavior again showed 
an overall decrease in a manner similar to that in the first intervention. The 
extent to which the occurrence of the target behavior reversed during the 
non-intervention condition relative to the intervention condition increases one's 
certainty that the Intervention was effective. 



FIGURE 5. An example of a B-A-B evaluation procedure used to docu- 
ment the impact of intervention on a child's self-destructive behavior. 
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•lULTIPLE BASELINE PROCEDURE 



A third, and somewhat more complex, single ubject evaluation procedure is 
the multiple baseline technique (Baer, Wolf, & Risle- , 1968). Tiie multiple baseline 
technique is useful when the target behavior is pcLentially irreversible or when it 
is inappropriate to institute a reversal condition. Implementation of the multiple 
baseline technique involves a series of eight steps: 

1. Identifying two or more target behaviors which are not expected to be 
related. Skills or behaviors which are related or similar, such that an 
intervention applied to one results in changes in the other, are said to be 
of the same response class . Behaviors or skills included in the multiple 
baseline technique must be independent of one another (i.e., different 
response classes). For example, if a parent designs a metl\od to increase 
a child's verbal response to questions, one might also expect that the 
child's total verbal behavior will increase. Consequently, both behaviors 
should not be included in the same multiple baseline study. 

2. Defining the target behaviors in a manner such that two or more ob- 
servers may agree on the occurrence of the behaviors. 

3. Obtaining data to support interobserver agreement on the occurrence of 
the target behaviors. 

^. Initiating concurrent baseline collection procedures for each of the 
identified target behaviors. 

5. Continuing the baseline measures until the absence of trends cjan be 
documented. 

6. Implementing an intervention for one of the target behaviors while con- 
current baseline measures continue for the remaining target behaviors 
(noting any change in the occurrence of the target behavior during, 
intervention and any concurrent changes in the target behaviors still 
under baseline conditions). 

7. Implementing an intervention for a second target behavior while 
concurrent baseline measures continue for any remaining untreated target 
behaviors. Changes in the occurrence of the second target behavior and 
concurrent changes in any of the other target behaviors should be noted. 

8. Continuing to implement intervention in the same manner, one target 
behavior at a time, until all target behaviors are under intervention. 

The strength of the multiple baseline technique is based upon two attributes: 
(1) each of the individual target behaviors follows an A-B procedure which allows 
for a comparison between the baseline and intervention conditions, and (2) control 
measures for the target behaviors under intervention are found in the concurrent 
baseline measures. Thus, the degree to which the occurrence of the individual 
target behaviors change during intervention relative to baseline suggests the 
intervention to be effective. The extent to which such changes occurred in the 
target behaviors under instruction, while the target behaviors under concurrent 
baseline conditions remained stable, further demonstrates the impact of the 

intervention. . . 

The objections to the A-B procedure, :hat changes in the occurrence of the 
target behavior might have occurred witm or without the intervention, are 
substantially decreased with the multiple baseiine technique. If such changes would 
normally have occurred, one would expect similar changes in the concurrent 
baseline measures of the other target behaviors. Thus, when a change occurs for 
a behavior during intervention while the concurrent baselines remain stable, 
evidence is accrued that the intervention is contributing to the effect. This is the 
rationale which mandates that the selected target behaviors be independent of each, 
other (i.e., different response classes). When target behaviors o' similar response 
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classes are utilized,, an intervention applied to one of the behaviors might inad- 
vertently Impact upon the other. As such, the behaviors would co-vary and the 
control characteristics of the target behavior under basoline would be lost. 

There are essentially three types of multiple baseline techniques { Hal! , 
Cristler, Cranston, & Tucker, 1970), each of which follows the same basic pro- 
cedures outlined above. The three types of multiple baseline techniques Include: 

1. Across Behaviors . This procedure involves the selection of two or more 
unrelated behaviors for each child. Intervention is applied to each in 
turn, with concurrent baseline conditions serving as control conditions for 
the intervention conditions. This procedure serves to evaluate the 
relative impact of intervention upon each of the target behaviors for a 
specific child. 

2* Across Settings . The same target behavior is measured concurrently 
across two or more settings (e.g., home vs. school) for the same child. 
Intervention is applied in succession for each of the settings. 
Concurrent baseline measures across settings serve as control conditions 
for the intervention conditions. This procedure serves to evaluate the 
role of the environment In contributing to the occurrence of the target 
behavior. 

3. Across Subjects. The same target behavior Is measured concurrently 
across two or more children. One child receives intervention while 
concurrent baseline data are collected for the remaining children. 
Intervention is ther» applied in succession to the remaining cliildren while 
concurrent baseline data are collected. This procedure controls for the 
influence of subject variables as they relate to the Intervention. 

Multiple baseline techniques may be of particular value In educational settings 
in that a reversal condition is not necessary. Since instructional interventions are 
directed toward increasing the child's ability to perform a given skill In a relatively 
permanent manner, a return to baseline levels during a non-intervention condition 
may not be expected. Further, the multiple baseline technique may not be subject 
to the objections found in reversal evaluation procedures (that It may be Inappro- 
priate or unethical to withdraw intervention for a given target behavior). 
However, the multiple baseline technique does not demonstrate behavioral control as 
directly as reversal procedures since the impact of intervention is inferred from 
Independent behaviors (Hersen & Barlow, 1976). Additional evidence may be 
obtained by including an A-B-A-B reversal procedure for one or more of the target 
behaviors in the multiple baseline technique {Kazdin 5 Kopel, 1975). 

Figure 6 shows a simplified graphic representation of the multiple baseline 
design used to evaluate the Impact of intervention with one child and three target 
behaviors (i.e. , an across behaviors design) . As shown, concurrent baseline 
measures were obtained for each of the three target behaviors. Intervention was 
applied to one of the target behaviors and a subsequent increase in the child's skill 
performance was noted. Concurrent baseline measures were also obtained for the 
remaining target behaviors while noting changes in their occurrence. Intervention 
was then applied to the second target behavior and subsequent increases in the 
child's performance of the skill were noted. A concurrent baseline was obtained for 
the third target behavior and change in its occurrence was noted. 

Examination of Figure 6 shows that changes in the occurrence of the target 
behaviors are consistently greater during the intervention conditions than during 
the baseline conditions. The degree to which the occurrence of the behaviors 
changed during intervention relative to the respective baseline measure for each 
behavior (as in the A-B procedure) suggests the instruction to have been effective. 
The extent to which the occurrence of the target behaviors increased while the 
concurrent baseline measures remained comparatively stable further increases one's 
confidence that the intervention contributed to the observed increase. 
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THE PRESENTATION AND INTERPRETATION OF RESULTS 



The primary means of interpreting th'i results of tlie single subject design is 
through visual analysis of graphically presented data. Graphic presentation of the 
data allows the impact of an intervention technique to be easily demonstrated by 
illustrating that change in the target behavior occurred simultaneously with the 
initiation of the intervention. Although a variety of statistical procedures are 
available to aid in the interpretation of the data (see Gottman 5 Glass, 1978; Levin, 
Marasciulo & Hubert, 1978), visual appraisal of the data Is most generally utilized. 
With visual analysis, the change in the behavior across sessions or conditions must 
be large enough to be visible to the eye. As Sidman (1960) noted, when using 
visual analysis to interpret data, interventions must produce "large effects" in 
order to be considered successful. Whereas a statistical comparison of behavior 



FIGURE 6. Example of the multiple baseline technique used to document 
the impact of three instructional programs. 
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under baseline and t^e^ nent conditions may indicate differences, a visual anal- 
ysis of the data may t. Visual analysis is therefore the more conservative 
approach {Glass, Wilsoi : Gottman, 1975; Parsonson & Baer, 1978). As Jones, 
Welnrott and Vaught {19 . demonstrated, visual analysis results in fewer errors of 
incorrectly attributing ange In behavior to the intervention. In addition, the 
clinical relevance of change In behavior across conditions which cannot be detected 
visually Is questionable. 

Data are usually graphed throughout the course of a single subject procedure. 
Thus, changes can be made In the Intervention as the study proceeds, providing a 
degree of adaptabllitv necessary in applied research. A standard format Is used to 
graph behavioral data. The vertical axis {ordinate) Is used to indicate the 
occurrence of the target behavior, while chronological time {the Instructional session 
or the day) is plotted along the horizontal axis {abscissa) {see Figure 7). The 
occurrence of the target behavior may be expressed In the form of frequency, rate, 
duration, or percentage. The form of presentation depends upon the measurement 
method used to collect the data and the goals of the teacher. 

Line graphs and bar graphs are frequently used in data presentation. In a 
line graph, discrete points are plotted and then joined with a line, as in Figure 7. 
Line graphs are appropriate when the time dimension is continuous and when the 
distances between points on the abscissa are equal. However, when the time 
between sessions is unknown, a bar graph is more appropriate {Parsonson & Baer, 
1978). An example of a bar graph is in Figure 8. Additional information regarding 
the graphing of data from a single subject design Is available in Cooper {1974), 
Koorland and Martin {1975), Meyers (1970), and Spear {1969). 

Three properties of the data aid the instructor in an evaluation of the ef- 
fectiveness of the in*-' ention. These are: {a) trend in the target behavior; 



FIGURE 7. An example of a line graph. 
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(b) the level of the behavior; and (c) the variability of the behavior. Visual 
analysis of the data includes determination of changes in trend, ievei and variability 
of the target behavior between conditions or phases, ' 

TREND, A trend line, as represented by the solid line in Fi ure 7, 
represents the general trend of the daLi. Trend lines may be either dr^^wn free 
hand or by using statistical methods (see Parsonson & Baer, 1978). !f done 
carefully, the free-hand method is adequate for use by teachers in an evaiLiation of 
instructional techniques, A comparison of the slope of the trend Iine5 during 
different conditions of the evaluation procedure yields information regarding the 
efficacy of the intervention. Both the amouht and direction of the changes in trend 
provide information to the teacher. For example, during the baseline condition in 
Figure 7 there is an upward trend in the data. The slope of the trend line during 
the treatment condition changes dramatically, indicating that the intervention was 
effective. Changes of trend in the data within an experimental condition also 
provide information to the teacher regarding the efficacy of the intervention 
(Parsonson &' Baer, 1978), 

LEVEL. Changes in the level of the occurrence of the target behavior 
between conditions are also a good indication that the intervention is effective. All 
of the data points within a condition are averaged to provide an estimation of the 
level of behavior during a particular condition. The level of the target behavior is 
indicated by the dashed line in Figure 7. As can be seen from the data, there is a 
small but visible difference in the level of the target behavior between th^ baseline 
condition and the treatment condition and a larger difference between the treatment 
condition and the reversal condition. These differences, therefore, indicate that 
the intervention is effective. 



FIGURE 8. An example of a bar graph. 
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VARIABILITY. A study of the variability between d:^ta points within and 
between conditions is also made by the teacher in analyzing the data. As pre- 
viously mentioned, a stable baseline is preferred. St^^ble baselines usually provide 
a better basis for attributing change in the target oehavior to the intervention. 
However, variable baseline data as compared with stable data during the treatment 
condition may be indicative that the treatment is effective. The elimination of 
variability may, in fact, be the goal of the treatment. Alternatively, an effective 
treatment may override any variability dufing the baseline condition. The 
occurrence of variability during the baseline condition may also provide the teacher 
with impurtant Information about extraneous factors affecting the occurrence of the 
target behavior. The identification and control of these factors could have im- 
portant treatment ramifications. 



SUMMARY 



It has been argued that the single subject design can provide the means 
necessary to determine the impact of the treatment nnd instructional techniques 
used with handicapped children. One of the primary advantages of the single 
subject procedure is that it is relatively simple and analysis of the data does not 
require statistical sophistication. Thus, a teacher wishing to assess the efficacy of 
an instructional program needs only to follow a few steps. First, the behavior 
which the teacher wants to change must be selected and defined. The precision 
and accuracy of the definition can then be checked through analyzing the agreement 
between two people regarding the occurrence of the target behavior while 
simultaneously and independently observing the same child. Next, a baseline of the 
behavior is obtained. After the baiseline is stable, the teacher can initiate the new 
instructional program. The behavior is measured continuously throughout the 
entire baseline and intervention process. 

The teacher may select one of several designs to aid in the interpretation of 
the efficacy of the instructional method. The A-B-A design may not be used often 
since it may not always be appropriate or ethiral . to institute a reversal condition 
without reinstating the intervention. As such, the A-B-A-B and B-A~B designs 
may be more useful since they call for the intervention to be reinstated during the 
final phase. Since it is seldom appropriate to reverse classroom behaviors, as in a 
reversal design, most teachers will find multiple baseline designs to be the most 
appropriate. The B-A-B design may be chosen when it is not possible to obtain a 
baseline of the behavior prior to the intervention. This may occur when the 
evaluation procedure is started after the instructional program has begun or when 
it is necessary to begin an intervention immediately. Conversely, the multiple base- 
line design requires that baselines are obtained^^on several behaviors prior to ini- 
tiating programs which may change these behaviors. This latter design is 
particularly applicable when a teacher wishes to change more than one behavior. 
The choice of the design is thus determined by the behavior to be modified, the 
instructional techniques to be evaluated, and the goals of the teacher. 

Not only is the methodology of the single subject procedure simple, .but it is 
also well suited to P.L. 9^1-142. The law requires that each child have an 
individualized education plan, based on the child's needs, that is continuously 
monitored and updated. The emphasis of the single subject design is on the 
individual. In addition, ongoing collection, graphing and interpretation of the data 
provide the teacher with continuous information regarding the child's progress so 
that the instructional plan can be changed when necessary. 
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